Generic Association Rule Bases: Are They so Succinct?

نویسندگان

  • Tarek Hamrouni
  • Sadok Ben Yahia
  • Engelbert Mephu Nguifo
چکیده

In knowledge mining, current trend is witnessing the emergence of a growing number of works towards defining “concise and lossless” representations. One main motivation behind is: tagging a unified framework for drastically reducing large sized sets of association rules. In this context, generic bases of association rules – whose backbone is the conjunction of the concepts of minimal generator and closed itemset (CI) – constituted so far irreducible compact nuclei of association rules. However, the inherent absence of a unique minimal generator (MG) associated to a given CI offers an “ideal” gap towards a tougher redundancy removal even from generic bases of association rules. In this paper, we adopt the succinct system of minimal generators (SSMG), newly redefined in [1], to be an exact representation of the MG set. Then, we incorporate the SSMG into the framework of generic bases to only maintain the succinct generic association rules. After that, we give a thorough formal study of the related inference mechanisms allowing to derive all redundant association rules starting from succinct ones. Finally, an experimental study shows that our approach makes it possible to eliminate without information loss an important number of redundant generic association rules and thus, to only present succinct and informative ones to users.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Revisiting Generic Bases of Association Rules

As a side effect of unprecedented amount of digitization of data, classical retrieval tools found themselves unable to go further beyond the tip of the Iceberg. Data Mining in conjunction with the Formal Concept Analysis, is a clear promise to furnish adequate tools to do so and specially to be able to derive concise generic and easy understandable bases of ”hidden” knowledge, that can be relia...

متن کامل

Succinct Minimal Generators: Theoretical Foundations and Applications

In data mining applications, highly sized contexts are handled what usually results in a considerably large set of frequent itemsets, even for high values of the minimum support threshold. An interesting solution consists then in applying an appropriate closure operator that structures frequent itemsets into equivalence classes, such that two itemsets belong to the same class if they appear in ...

متن کامل

Fuzzy Apriori Rule Extraction Using Multi-Objective Particle Swarm Optimization: The Case of Credit Scoring

There are many methods introduced to solve the credit scoring problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favourite in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In this paper multi-objective particle swarm is applied to optimize fuzzy apriori rule base in credit scoring. ...

متن کامل

Fuzzy Apriori Rule Extraction Using Multi-Objective Particle Swarm Optimization: The Case of Credit Scoring

There are many methods introduced to solve the credit scoring problem such as support vector machines, neural networks and rule based classifiers. Rule bases are more favourite in credit decision making because of their ability to explicitly distinguish between good and bad applicants.In this paper multi-objective particle swarm is applied to optimize fuzzy apriori rule base in credit scoring. ...

متن کامل

Succinct System of Minimal Generators: A Thorough Study, Limitations and New Definitions

Minimal generators (MGs) are the smallest ones (w.r.t. the number of items) among equivalent itemsets sharing a common set of objects, while their associated closed itemset (CI) is the largest one. The pairs composed by MGs and their associated CI divide the itemset lattice into distinct equivalence classes. Such pairs were at the origin of various works related to generic association rule base...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006